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COMPARISONS  OF  THE  PERCENTAGE  POINTS  OF  DISTRIBUTIONS 
WITH  THE  SAME  FIRST  FOUR  MOMENTS,  CHOSEN  FRCM  EIGHT 
DIFFERENT  SYSTEMS  OF  FREQUENCY  CURVES 

by 

E.S.  Pearson,  University  College,  London 
N.L.  Johnson,*  University  of  North  Carolina  at  Chapel  Hill 

and 

I.W.  Burr,  Purdue  University 

1.  PURPOSE  OF  THIS  INVESTIGATION 

Oar  object  is  to  study  the  extent  to  which  the  probability  integrals  of 

members  of  different  systems  of  imvariate,  unimodal  frequency  curves,  y = f(x), 

having  identical  first  four  central  moments,  are  in  agreement.  If  the  variables, 

x,  are  standardized  so  as  to  have  a zero  mean  and  unit  standard  deviation,  we 

are  proposing  to  investigate  to  what  extent  the  two  "shape"  parameters 
3 4 

/&1  = P 3/0  and  ~ P4/0  * often  described  as  measuring  skewness  and  kurtosis, 
can  provide  estimates  for  several  different  systems  of  distributions,  of  the 
probability  integrals 

x 

P = / f(x)dx  . 

-OO 

2.  HISTORICAL  SUMARY  OF  THE  DEVELOPMENT  AND  USES  OF  FREQUENCY  CURVES 
2.1.  Fitting  to  observational  data 

It  was  realized  towards  the  end  of  the  19th  century  that  the  frequency 
distributions  of  many  series  of  observed,  continuously  distributed  data  could 
not  be  adequately  represented  by  the  normal  or  Gaussian  probability  law,  for 
which  /Bj  = 0,  B2  = 3.0.  Two  systems  of  non-normal  distributions  were  then  put 
forward: 

K 
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(a)  The  Gram-Charlier,  or  rather  similar  Edgeworth  systems  depending  on 
series  expansions  involving  the  normal  function  and  its  derivatives. 

(b)  The  Pearson  system,  based  on  the  solution  of  the  single  differential 


equation 


- (Cj+X) 


7 c0  + clx  + C2X 

For  both  systems  the  parameters  of  the  curves  are  expressible  in  terms  of  the 
mean  yj  (which  for  a standardized  variable  is  zero)  and  the  higher  moments  about 
the  mean,  y ^ (i  = 2,3,4,...).  In  graduating  an  observed  frequency  distribution 
by  a theoretical  curve  it  was  for  long  the  practice  to  equate  the  moments  of  the 
latter  to  those  of  the  former  distribution.  While  the  Pearson  curves  of  system 
(b)  require  no  more  than  the  first  four  moments,  the  series  expansions  of  system 
(a)  allow  for  the  introduction  of  more  moments  and  therefore  might  be  expected 
to  provide  a closer  fit.  However,  in  practice,  this  possibility  provides  little 
advantage  when  attempting  to  graduate  observed  data  subject  to  sampling  errors, 
because  the  higher  moments  of  the  data,  nr , are  subject  to  standard  errors  increas- 
ing rapidly  with  i. 

If  we  compare  like  with  like,  e.g.  use  only  four  moments  in  either  system,  we 
find  as  shown  by  Barton  and  Dennis  (1952)*  that  outside  a rather  restricted  region 
in  the  Bp  S2  field,  the  Gram-Charlier  and  Edgeworth  curves  may  cease  to  be  posi- 
tive definite  and  unimodal.  Although  we  shall  not  be  concerned  in  this  report 
with  the  use  of  frequency  curves  in  graduating  observational  data,  this  weakness, 
as  well  as  pressure  on  time  and  space,  influenced  us  in  deciding  to  exclude  curves 
of  the  systems  (a)  from  our  investigation. 

2.2  New  uses  for  systems  of  frequency  curves 

Apart  from  using  mathematical  curves  to  graduate  observed  frequency 
distributions,  it  was  realized  that,  in  the  development  of  statistical  theory, 

See  Draper  and  Tierney  (1972)  for  further  comnents  on  this  region  and  some 
additional  points. 
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these  curves  had  two  other  very  useful  functions: 

(a)  They  could  be  used  to  represent  approximately  the  sampling  distribution 
of  a statistic  in  cases  where  the  true  distribution  was  difficult  to 
derive  explicitly,  but  its  moments  were  known  or  at  any  rate  calculable. 

(b)  They  could  be  used  to  represent  population  distributions  in  studies  of 
the  robustness  of  tests  and  in  procedures  of  estimation  which  had  been 
based  on  the  assumption  of  parental  normality. 

The  pioneer  work  in  the  direction  (a)  seems  to  have  been  taken  by  Student 

(W.S.  Gosset)  who  in  his  investigation  (1906)  on  how  to  treat  the  mean  and  variance 

in  very  small  experimental  sanples,  where  the  variables  could  be  assumed  to  be 

normally  distributed,  took  a number  of  illuminating  steps: 

(i)  first  he  derived  the  3rd  and  4th  moments  of  the  sample  estimate  of  var- 
2 2 

iance,  s , the  expectation  of  s and  its  2nd  moment  being  already  known; 

2 2 

(ii)  then  he  realized  that  the  values  of  6^(s  ) and  62 Cs  ) were  those  of  a 
Pearson  Type  III  or  gamma  distribution; 

7 

(iii)  assuming  this  last  to  be  the  true  distribution  of  s , and  having  shown 

_ 2 

that  in  samples  from  a normal  population,  x,  the  mean  and  s were  inde- 


pendent, he  deduced  the  sampling  distribution  of  z = t//v  = (x  - pJ)>Ws' 

2 2 

(where  v = n-1  and  s'  = Z(x-x)  /n)  and  found  this  to  be  a Pearson  Type 


VII  curve.  Student  was  unaware  that  Abb6  and  Helmert  had  previously 
proved  mathematically  that  s unquestionably  had  this  Type  III  form,  a 
result  which  Fisher  also  confirmed  in  1915.  However,  Student’s  line  of 
approach  is  one  which  has  since  been  followed  where  no  true  distribution 
is  known,  only  the  moments. 

9 

A number  of  years  later  it  was  again  Student  (1927)  who  broke  fresh  ground 
by  calculating  a table  of  approximate  upper  10,  4 and  2 percentage  points  of  the  — 
range  (w)  in  samples  of  n = 2(1)10  from  an  N(0,1)  population,  using  Pearson  curves 
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having  the  moments  recently  published  (Tippett,  1925;  Pearson,  1926).  A fuller 
and  more  accurate  table  of  percentage  points  for  range,  using  the  best  available 
estimates  of  the  moments  to  be  used  in  fitting  a Pearson  curve,  was  published 
a few  years  later  (Pearson,  1932),  and  when  accurate  percentage  points  were 
computed  ab  initio  (Pearson  and  Hartley,  1942)  it  was  realized  how  closely  the 
"Pearson  curve  fitting"  procedure  had  given  the  true  values. 

At  about  the  same  time  as  these  approximations  to  the  distribution  of  the 
range  were  being  developed,  one  of  us  (Pearson,  1930,  1931)  had  used  the  work  of 
Fisher  (1928,  1929)  and  Wishart  (1930)  to  derive  approximations  to  the  lower  and 
upper  5 and  1 percent  points  of 

Vb^  = m^/s^  and  b2  = m^/s^ 

in  samples  of  n from  a normal  population,  using  Pearson  Type  VII  and  Type  IV  curves 
having  the  correct  first  four  moments.  The  results  were  only  given  for  relatively 
large  samples,  i.e.  forjb n > 50,  and  for  b^,  n > 100. 

When  Johnson  (1949)  developed  his  new  system  of  Sg  and  Sy  frequency  curves 
he  used  a rather  different  method  of  exploring  the  similarity  between  Pearson  and 
Johnson  curves  having  the  same  first  four  moments.  His  procedure  was  to  calculate 
the  expected  group  frequencies  of: 

(a)  An  Sg  and  a Pearson  Type  I curve,  both  fitted  using  the  same  four  moments 
to  an  observed  frequency  distribution*  of  N = 631,682  observations  for 
which  /bj  = 0.318,  b2  = 2.430. 

(b)  An  Sy  and  a Pearson  Type  IV  curve  fitted  in  the  same  way  to  an  observed 
distribution  of  N = 9440  observations*  for  which  /b^  = 0.910,  b2  = 4.863. 

(c)  An  Sy  and  a Pearson  Type  IV  curve  fitted  similarly  to  another  distribu- 

r | 

tion  of  N = 9440  observations*  for  which  7 b^  = 0.441,  b2  * 3.654. 

I ' 

All  three  distributions  were  taken  from  Pretorius  (1930). 
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If  alternatively  we  make  use  of  the  Tables  Al,  A2,  A3,  A4  discussed  below,  we 
see  that  with  /$^  =s/b^  and  B2  ~ t>2’  t^ie  differences  between  all  percentage  points 
of  Pearson  and  Johnson  curves  are  < 0.01  (i.e.  1/100  of  the  S.D.)  in  the  stretch 
between  and  including  the  lower  and  upper  2.5%  points  for  cases  (a)  and  (b)  and 
as  far  out  as  the  lower  and  upper  1%  points  for  case  (c) . This  showed  how  similar 
the  Pearson  and  Johnson  curves  are  in  these  three  examples,  except  towards  the 
tails. 

Merrington  and  Pearson  (1958)  introduced  a further  family  of  probability  dis- 
tributions into  the  comparison  by  examining  how  closely  the  distribution  of  non- 
central t could  be  represented  by  a Pearson  curve  of  Type  IV. 

With  the  information  from  these  rather  diverse  comparisons  before  them, 

Pearson  and  Tukey  (1965)  decided  to  explore  the  possibility  of  a different  pro- 
cedure, that  of  estimating  the  standard  deviation  of  a distribution  by  applying 
factors  to  what  they  termed  the  "h%  distances,"  i.e.  the  distances  between  the 
lower  and  upper  h%  points  of  a distribution  which  had  not  been  standardized. 

The  slowly  changing  values  of  these  factors  were  shown  by  drawing  systems  of  con- 
tours in  the  &2  plane,  see  their  Figs.  2,  3 and  4 for  h = 5.0,  2.5  and  1.0, 
respectively.  In  the  course  of  examining  their  problem  they  calculated  afresh  or 
collected  from  elsewhere  the  0.5,  1.0,  2.5  and  5.0%  points  of  29  distributions 
selected  from  the  Pearson,  Johnson,  log-normal,  log  x2  and  non-central  t distri- 
butions. 

Similar  comparison  of  standardized  % points  have  been  made  elsewhere,  e.g. 
in  Pearson  (1963)  and  in  Pearson  and  Hartley’s  Biometrika  Tables  for  Statisticians, 
Vol.  2 (1972,  p.  76).  The  existence  of  the  tables  of  standardized  percentage 
points  of  several  families  of  frequency  distributions  (see  Appendix  for  refer- 
ences) as  well  as  the  availability  of  worked  out  computer  programmes  has  made 
these  comparisons  much  easier  to  carry  out  than  formerly.  For  this  reason  it 


, / < 


seemed  to  us  that  the  time  had  cane  for  systematizing  and  extending  these  com- 
parisons, as  well  as  filling  in  certain  gaps. 

3.  THE  SCOPE  OF  THE  PROGRAMME  UNDERTAKEN 
3.1.  The  starting  point. 

It  may  be  said  that  there  are  two  aspects  of  the  subject: 

(a)  Its  interest  as  disclosing  some  rather  unexpected  properties  of 
univariate  frequency  distributions.  For  instance,  the  two-decimal 
place  comparison  shown  in  Table  1 of  standardized  5 and  1 percent  points 
of  members  of  five  distinct  families  having  beta  values  close  to 

/B^  = 0.8,  f?2  = 4.2,  inevitably  raises  the  question:  over  what  area  in 
the  beta  field  does  this  degree  of  correspondence  exist? 

(b)  What  use  can  be  made  of  our  results  in  practical  or  theoretical  research 
in  mathematical  statistics? 


TABLE  1.  Illustration  of  comparisons 


- — -z.  T-=-  z.—  • : . ■ 

1 Shape  parameters 

Standardized  percent  points 

Family 

/gl 

e2 

Lower  1 

Lower  5 

Upper  5 

Upper  1 

Pearson,  Type  VI 

0.800 

4.200 

-1.80 

-1.40 

1.82 

2.90 

Johnson,  Sy 

.800 

4.200 

-1.80 

-1.40 

1.82 

2.91 

Not -central  t 

.780 

4.229 

-1.83 

-1.42 

1.81 

2.89 

Log-normal 

.814 

4.200 

-1.78 

-1.40 

1.83 

2.91 

Log-x2 

.780 

4.188 

-1.83 

-1.41 

1.81 

2.90 

, 

Note.  Published  tables  for  Pearson  and  Johnson  curves  give  values  of  the  % points 
at  exactly  /B^  = 0.8,  = 4.2;  this  beta-point  is  outside  the  non-central 

t area;  no  log-normal  or  log*x2  distributions  have  /Bj  = 0.8,  B£  = 4.2 
exactly. 
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Throughout  the  following  analysis  /g^  rather  than  g^  has  been  taken  as  the 
argument  for  skewness;  this  was  because,  to  ease  interpolation,  the  former  has 
been  used  in  a number  of  tables,  e.g.  of  the  standardized  percentage  points  of 
Pearson  curves.  This  advantage  has  to  be  balanced  against  certain  disadvantages, 
e.g.  (a)  the  regional  boundaries  of  a chart  like  Fig.  1 cease  to  be  linear  or 
near-linear;  (b)  the  Pearson  Type  I and  Sg  areas  are  cramped  for  space  compared 
with  those  for  Type  IV  and  S^.  As  a result,  if  calculations  are  made  using  a 
grid  having  equal  intervals  in  terms  of  /g^  it  is  impossible  to  follow  changes 
in  such  detail  in  the  Type  I-Sg  as  in  the  Type  IV-Sy  area.  Whether  this  matters, 
depends  on  where  our  interests  are  mainly  focussed. 

3.2.  The  Pearson  and  Johnson  distribution  comparisons. 

The  main  comparison,  set  out  in  Tables  A1-A10,  has  been  made  between 
distributions  of  the  Pearson  and  the  Johnson  systems,  both  of  which  can  be  found 
at  every  beta  point  contained  in  the  field  of  Fig.  1.  The  Johnson  curves  are 
divided  into  two  types,  Sy  and  Sg,  their  appropriate  regions  being  separated  by 
the  log-normal  line,  defined  by  the  parametric  equations 

g,  = (oj-1)  (oo+2) 2 (/g,  > 0)  ' 

432  (1) 

$2  ~ <*>  + + 3to  - 3 

Because  it  was  only  necessary  to  read,  without  any  interpolation,  from  exist- 
ing tables  of  standardized  percentage  points  of  these  systems,  their  comparison 
has  been  carried  out,  with  certain  exceptions,  at  a "grid”  of  beta  points  namely 
for  /gj  = 0.0(0. 3)1. 8,  2.0  and  g£  = 2. 2(0. 4)9.0,  9.8,  10.6,  11.8,  12.6,  13.8. 

The  discussion  of  the  comparisons  given  in  the  tables  will  be  aided  by  a 
simultaneous  study  of  Fig.  1.  We  did  not  carry  the  comparison  far  into  the 
Pearson  Type  I J-curve  area  nor  into  the  region  at  the  bottom  left  hand  comer 
of  the  diagram  where  the  difference  between  the  shape  of  the  Pearson  and  Johnson 
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curves  had  become  so  large  that  comparisons  ceased  to  be  useful.  In  Tables  B 
as  in  A,  the  standardized  percentage  points  for  the  Pearson  distribution  are 
given  at  every  point  of  comparison  to  three  decimal  places,  followed  by  the 
amount  (in  0.001’s)  to  add  to  these  values  to  obtain  those  for  other  systems. 

This  was  done  to  economize  space  and  not  because  the  Pearson  curve  values  were 
regarded  as  necessarily  the  most  important.  Also,  so  as  not  to  overcrowd  the 
diagram,  certain  of  the  grid  points  treated  in  Tables  A and  B have  been  omitted 
in  Fig.  1. 

3.3.  Other  distributions  included. 

These  may  be  classed  in  two  categories  according  to  the  restriction  on 
the  beta  points. 

(a)  In  the  first,  these  points  fall  within  certain  areas  in  Fig.  1,  and  two 
independent  shape  parameters  are  involved:  the  Burr,  the  non -central  t 
and  non-central  x2  distributions  are  in  this  category. 

(b)  In  the  second,  the  points  lie  on  a line,  involving  only  a single  inde- 
pendent shape  parameter:  the  log-normal,  log  x2  and  Weibull  distribu- 
tions are  in  this  category.  Consider  these  restrictions  in  turn. 

3.3.1.  The  Burr  system. 

The  beta  points  available  for  this  system  cover  a very  broad  area 
(Burr,  1973).  See  the  appendix.  The  beta  points  chosen  for  this  study  corres- 
pond to  most  of  the  grid  points  selected  for  Pearson  Types  I,  IV  and  VI  (bell- 
shaped) and  for  Johnson  Sg  and  Sy. 

3.3.2.  The  non-central  t system 

The  beta  points  fall  in  an  area  lying  between  the  axis  of  B2  where  we 
have  central  t (or  Student)  distributions  and  a curved  line  along  which  the  non- 
central parameter,  6,  becomes  infinite.  This  was  shown  by  Merrington  and  Pearson, 


V, 
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(1958),  to  be  the  line  on  which  the  beta  points  for  the  distribution  of  the 
reciprocal  of  x fall.  It  lies  just  below  the  curve 

>)2  = 4(462-3el)  (232-^6)  (2) 

along  which  fall  the  beta  points  of  a Pearson  Type  V,  i.e.  of  a distribution  of 
the  reciprocal  of  x2 • 

Here,  because  of  the  labour  involved  in  deriving  the  two  parameters  v and  6, 

given  /B^  and  » we  did  not  use  the  "grid"  points,  but  found  standardized  per- 

« 

centage  points  for  eight  fairly  widely  dispersed  cases,  seme  of  which  had  already 
been  derived  in  earlier  papers  (e.g.  Pearson,  1963).  (See  Fig.  1 and  Table  B2.) 

3.3.3.  The  non-central  x2  system. 

The  beta  points  fall  (Pearson,  1959,  p.  364)  in  the  region  between 

the  lines 

62  - | Sj  - 3 = ° and  B2  - 1 Bi  - 3 = ° . (3) 

We  have  chosen  three  points  to  examine  in  this  area  (Fig.  1 and  Table  B4). 

3.3.4.  The  log-normal  distribution 

The  parametric  equation  of  the  log-normal  line  has  already  been  given 
in  equation  (1)  above.  As  shown  in  Table  Bl,  we  picked  out  for  study  the  ten 
distributions  for  which  6^  = 3.4,  3. 8(0. 8)8. 6,  9.8,  10.6. 

3.3.5.  The  log  x2  distribution. 

We  have  here  taken  three  beta-points  for  which  among  others  the 


parameters  were  given  by  Bartlett  and  Kendall  (1946).  The  positions  are  shown 
in  Fig.  1 and  the  standardized  percentage  points  in  Table  B5.  The  distributions 
are  negatively  skew,  and  have  been  reversed  in  the  table. 
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3.3.6.  The  Weibull  distribution. 

Harter  and  Dube)'  (1967)  gave,  as  an  Appendix  to  their  Report,  a table 
relating  the  parameter  m (their  M)  to  the  first  eight  standardized  cumulants  of 
the  distribution.  We  have  examined  nine  distributions,  those  with 

m = 1.1,  1.3,  1.5,  1.7,  2.0,  2.5,  3.6,  7.0,  10.0. 

For  m =■  3.6,  /g^  = 0.000,  = 2.717,  i.e.  a normal  distribution  is  not  included 

in  the  system;  for  greater  values  of  m the  distribution  is  negatively  skew,  and 
it  has  therefore  been  reversed  so  that  the  beta  points  for  m = 7.0  and  10.0  can 
be  included  in  our  field  of  study.  The  values  of  /gp  associated  with  these 
nine  values  of  m are  shown  with  the  standardized  percentage  points  in  Table  B3 
and  the  (/gp  g 2 ) points  are  plotted  in  Fig.  1. 


4.  DISCUSSION  OF  THE  NUMERICAL  COMPARISONS  SHOWN  IN  TABLES  A AND  B 


4.1.  The  "cross -over"  points. 

Tables  A and  B would  provide  if  required, 15  points  on  the  standardized 
cumulative  distribution  curves  of  each  of  the  large  number  of  distributions  con- 
sidered. If  these  curves  were  to  be  drawn  for  each  of  the  two  - or  three  - 
distributions  compared  at  a given  (gp  g^)  point,  it  would  be  found  that  they 
would  have  three,  and  sometimes  four  or  even  five  cross-over  points  with  the 
correspondong  Pearson  curve.  The  existence  of  these  crosses  results  from  tying 
down  the  distributions  to  have  common  first  four  moments.  While  we  should  have 
liked  to  give  a diagrammatic  illustration  of  this  property,  in  even  quite  extreme 
'.ases  of  disagi cement  of  the  percentage  points  of,  say,  a Burr  and  a Pearson 
curve,  the  cumulative  curves  lie  too  closely  together,  having  regard  to  the 
distances  between  the  extreme  upper  and  lower  0.251  points,  for  visual  repre- 
sentation to  be  helpful. 


The  arithmetical  results  in  the  Tables,  however,  make  it  possible  to  assess 
with  reasonable  accuracy  where  the  crosses  occur.  Take,  for  example,  the  case 
of  ~ 5.4,  = 0.9  given  in  Table  A4:  it  will  be  seen  that  both  the  Sy  and 

Burr  curves  have  cross-overs  with  the  Pearson  Type  IV,  (a)  near  the  median 
(P  = 0.50),  (b)  between  the  lower  2.51  and  5%  points  (P  = 0.025  and  0.05),  and 
(c)  between  the  upper  101  and  5%  points  (P  = 0.90  and  0.95). 

Though  there  is  much  variation  in  these  cross-overs  from  case  to  case,  probab- 
ly it  is  the  median  and  the  two  51  points  which  most  frequently  occur  in  designat- 
ing their  location. 

V 

Tables  A have  been  arranged  in  pairs,  facing  each  other,  to  make  an  ex- 
tensive survey  of  changes  as  easy  as  possible.  We  can  only  refer  to  a fev.  of  the 
points  suggested  by  this  survey.  The  Sy  and  Sg  regions  are  separated  by  the  log- 
normal line  of  equation  (1) . One  of  the  most  striking  results  brought  out  in  the 
present  study  is  the  closeness  in  agreement  between  the  standardized  percentage 
points  of  the  log-normal  and  the  corresponding  Type  VI  distributions . This  is 
shown  by  the  differences  given  in  Table  Bl;  we  have  not  succeeded  in  finding  a 
mathematical  explanation  of  this  phenomenon.  The  differences  gradually  increase 
as  we  pass  down  the  log-normal  line,  starting  from  the  Normal  point.  If  we  take 
as  an  arbitrary  but  useful  yard-stick  a difference  as  great  as  ±0.010  or  l/100th 
of  the  standard  deviation  it  is  seen  that  this  value  is  only  exceeded:  (a)  at  the 
lower  1%  point  when  reaches  6.2,  (b)  at  the  lower  2.5%  point  when  $2  reaches 
7.8  and  (c)  has  only  just  got  there  at  the  lower  5%  point  where  our  table  cuts 


jarison  of  the  Pearson,  log-normal  and  Johnson  distributions  (Sy  and 


off  the  line  at  &2  ~ 10*6.  For  the  corresponding  upper  percentage  points  a 
difference  of  0.010  is  not  reached  at  all.  As  elsewhere,  agreement  is  less  satis- 


factory  in  the  lower  steep  tails  of  the  distributions  than  at  the  upper,  drawn- 
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If  we  move  "south-westwards"  from  the  log-normal  line  into  the  Sy  area  we 
find  that  the  differences  Sy-PC  gradually  increase  until  they  become  really 
large,  particularly  in  the  lower  tail.  Using  the  same  0.010  difference  yard- 
stick, we  find  the  results  shown  in  Table  2. 


TABLE  2 . Absolute  differences  between  standardized 
5%  points  of  Sy  and  corresponding  Pearson 
curve  exceeding  0.010. 


When 

At  lower  tail 

At  upper  tail 

/Bj  = 0.0 

when  > 7.8 

B2  > 7.8 

0.3 

> 7.0 

> 8.6 

0.6 

> 5.6 

> 9.0 

0.9 

> 5.8 

> 9.6 

1.2 

> 7.4 

> 9.8 

1.5 

> 10.4 

>10.0 

1.8 

outside  tables 

If  these  limiting  points  were  inserted  in  Fig.  1 they  would  include  a large 
part  of  the  Sy  area. 

Moving  "north-eastwards"  from  the  log-normal  line,  the  differences  between 
the  Sg  and  the  Pearson  curve  distribution  increase  rapidly  but  as  pointed  out 
above  the  position  would  look  rather  different  if  the  beta-points  of  the  tabu- 
lation had  increased  by  equal  steps  in  6^  rather  than  /8^.  Also,  as  the  boundary 
for  Pearson  Type  I J-curves  is  approached  agreement  in  the  lower  tail  is  hardly 
to  be  expected.  In  the  lower  half  of  the  distributions  agreement  is  much  better, 
e.g.  for  B2  = /Bj  = 0*9  in  Table  A2. 

4.3.  Comparison  of  the  Burr  with  the  Pearson  and  Johnson  distributions. 

One  of  the  most  notable  characteristics  seen  in  Tables  A is  that  the 
differences  between  the  standardized  percentage  points  of  (a)  Johnson  and  Pearson 
distributions  and  (b)  Burr  and  Pearson  distributions  at  a given  beta  point  are  of 
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„ opposite  sign.  While  it  is  not  wise  to  generalize  without  detailed  analysis,  it 

seems  that  towards  the  Normal  point  and  the  log-normal  line,  the  differences 
(b)  are  larger,  often  much  larger,  than  the  differences  (a). 

One  interesting  set  of  comparisons  was  made  for  us  by  Mr.  N.W.  Please  at 
= 10.8635,  /g^  = 2.0,  a point  lying  on  the  log-normal  line,  but  not  included 
in  Tables  A or  Bl.  Judging  from  the  differences  LN  - PC  when  62  = 10.6, 

/gj  = 1.969  given  in  the  latter  table,  we  should  expect  fairly  small  differences 
at  Please' s beta  point,  except  perhaps  for  the  lower  2.5,  1.0,  0.5  and  0.251 
points.  He  computed  the  standardized  moments  pr/o  for  r = 3(1)8,  with  results 
shown  in  Table  3. 


j* 

TABLE  3.  Comparison  of  standardized  moments,  yr/o  , 
of  three  distributions  having  $2  = 10.86, 
/gx  = 2.00. 


■ 

Pearson  Type  IV 

Log-normal 

Burr* 

3 

2.00 

2.00 

2.00 

4 

10.8635 

10.8635 

10.8635 

5 

71.84 

69.96 

75.12 

6 

705.2 

638.9 

844.9 

7 

10,209.3 

7,859.9 

18,089.4 

8 

235,007.3 

129,791.6 

2,500,459.8 

Note  the  way  in  which  the  moment  ratios  for  the  Type  VI  and  log-normal  keep  re- 
latively close  together,  while  those  for  the  Burr  distribution  shoot  off  in  an 
opposite  direction,  as  r increases. 

Clearly  there  will  be  distributions,  observational  or  theoretical,  better 
fitted  by  Burr  curves  than  by  the  perhaps  more  wide]  • used  Pearson  and  Johnson 

curves,  but  it  is  not  known  how  far  this  matter  has  been  explored. 

IT 


Figures  derived  by  Mr.  Please  from  Gruska  et  al.  (1973). 
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4.4.  Comparison  of  non-central  t (say  t')  with  Pearson  Type  IV  distributions. 

The  eight  comparisons  made  in  Table  B2  amply  confirm  Merrington  and 
Pearson's  (1958)  finding  of  the  close  agreement  between  the  distributions  of 
t'  and  Type  IV,  having  the  same  first  four  moments.  With  the  exception  of  the 

single  case  where  82  = 12.219,  /8j  = 1.732  (v  = 6,  6 = 2.65)  the  differences  are 

surprisingly  small,  on  the  whole  indeed  smaller  than  those  for  the  log-normal 
and  Type  VI  distributions  shown  in  Table  Bl.  From  the  mathematical  aspect 
Merrington  and  Pearson  pointed  out  that  the  p.d.f.  of  t'  contained  the  factor 
(1  + t^/v)  while  that  of  Type  IV  contained  the  factor  (1  + x^/a^)  m. 

4.5.  Comparison  of  non-central  x2  with  Pearson  Type  I distributions. 

Three  comparisons  are  made  in  Table  B4.  For  the  first  case  where 
$2  = 3.296,  /8j  = 0.468  and  the  parameters  v = 6,  A = 6 are  of  medium  size 
the  differences  (non-central  x2'PC)  are  small;  here,  the  beta-point  is  not  far 
from  the  Normal  point. 

If  only  3 beta  points  were  to  be  included,  looking  back  it  is  that  the 

second  and  third  cases  were  not  very  well  chosen  since  with  v = 1 and  2 the 

distributions  of  non-central  x2  are  very  skew.  For  central  x2*  with  X = 0, 
the  distributions  are  J-shaped,  with  8^  = 8.0,  82  = 15.0  when  v = 1,  and  8^  = 4.0, 
82  = 9.0  (the  exponential)  when  v = 2.  It  is  not  surprising  therefore  that  the 
differences  (non-centr<»i  y2-PC)  are  so  large. 

4.6.  Comparison  of  Weibull  with  Pearson  and  Sg  distributions. 

Table  B3  shows  that  the  absolute  values  of  the  differences  (W-PC) 


between  the  lower  and  upper  51  points  do  not  exceed  the  yard  stick  0.010  for 
the  parameter  m $ 2.0  and  when  m > 2.0  the  largest  difference  is  0.014. 


lJT 
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What  evidence  there  is  suggests  that  a Weibull  is  closer  than  an  Sg  to 
a Pearson  Type  I distribution. 

(a)  Compare  the  differences  (W-PC)  for  £$2  = 3.772,  /g^  = 0.865  in  Table  B3 
with  those  for  (Sg-PC)  at  the  neighbouring  grid  point  given  in  Table  A2. 

(b)  To  test  this  further  we  have  made  the  following  special  comparison 
between  the  standardized  percentage  points  of  the  distributions: 

Weibull  (m  = 1.3)  and  Pearson  curve,  both  with  = 5.432,  /gj  = 1.346 
(see  Table  B3)  and  Sg  with  = 5.40,  /gj  =1.35.  We  find  the  figures 
in  Table  4. 


TABLE  4.  Comparison  of  standardized  percent  points  for  Weibull,  Sg  and 
Pearson  distributions,  having  moment  ratios  in  the  neighborhood 
of  g2  = 5.4,  /gj  = 1.35.  (Differences  in  0.001*5). 


p 

W 

W-PC 

W-Sg 

P 

W 

W-PC 

W-Sg 

0.0025 

-1.275 

42 

133 

0.75 

0.505 

4 

18 

.005 

■■ 

117 

.90 

1.362 

2 

7 

.01 

1 

88 

.95 

1.957 

-2 

-15 

.025 

-1.206 

18 

47 

0.05 

-1.147 

8 

16 

0.975 

2.520 

-8 

-36 

.10 

-1.042 

-2 

.99 

3.229 

-48 

.25 

-0.754 

.995 

3.744 

-39 

0.50 

-0.236 

D 

.9975 

4.243 

-6 

-11 

In  certain  situations  there  are  thought  to  be  physical  reasons  suggesting 
that  variation  will  be  of  Weibull  form.  However,  when  this  is  not  the  case,  and 
we  have  neither  simulation  data  nor  knowledge  of  higher  moment  ratios,  it  would 
seem  that  we  should  make  a choice  from  these  three  distributions,  Weibull,  Sg 
and  Pearson  Type  I,  according  to  sinplicity  in  computation. 
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4.7.  Comparison  of  log  x2  with  Pearson  distributions. 


This  has  been  made  in  Table  B5  at  a selection  of  three  beta-points, 
which  are  seen  from  Fig.  1 to  lie  very  close  to  the  curve  dividing  the  Type  VI 
and  Type  IV  areas.  Agreement  is  excellent  when  the  degrees  of  freedom  of  x2 
are  v = 10  and  4.  When  v = 2 the  correspondence  deteriorates  outside  the  2.5% 
points.  Note  that  the  distributions  of  log  x2  are  negatively  skew,  and  there- 
fore the  position  of  the  tails  has  been  reversed  in  the  table. 


5.  CONCLUSION:  ILLUSTRATIONS  OF  APPLICATIONS 

5.1.  The  percentage  points  of  the  range  (w)  in  samples  from  a Normal  population. 

On  page  4 it  was  described  how  Pearson  curves  having  approximately 
correct  values  of  were  used  in  1932  to  estimate  the  positions  of  certain 

percentage  points  of  w.  Had  Johnson's  Sg  system  been  developed  at  that  date  it 
would  have  been  realized  that  an  Sg  curve  was  a possible  alternative,  approxi- 
mating distribution  to  use.  Then,  the  percentage  points  of  w,  say  at  n = 3 
to  12  could  have  been  calculated  and  compared  for  both  Type  I and  Sg  systems. 

To  two  decimal  places  the  differences  might  have  been  slight  and  this  would  per- 
haps have  given  increased  confidence  in  whatever  final  values  were  adopted  and 
used,  e.g.  in  industrial  quality  control  problems.  It  would  however  only  have 
been  possible  to  decide  whether  a Type  I or  Sg  approximation  (if  they  differed) 
was  the  more  accurate,  when  the  true  values  were  derived  by  direct  computation 
(Pearson  and  Hartley,  1942). 

4 

5.2.  The  distribution  of  b.,  = m./s  in  sanroles  from  a Normal  population. 


This  interesting  problem  on  which  a considerable  amount  of  attention 
has  been  focussed  for  nearly  50  years  shows  how  in  spite  of  the  knowledge  of  the 
true  sampling  moments  of  \>2  up  to  the  sixth  and  the  collection  of  literally  tens 
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of  thousands  of  simulated  samples,  no  completely  acceptable  answer  has  been  found. 
(The  (Bp  B2)  Points  t>2  for  large  values  of  n were  plotted  in  a chart  published 
by  Pearson  (1963,  p.  106).  These  were  largely  derived  from  an  unpublished 
PhD  thesis  by  C.T.  Hsu  (1939).  The  numerical  values  for  eight  values  of  n are 
given  in  columns  2 and  3 of  the  accompanying  Table  5.  For  some  of  these  cases 
there  are  also  available  the  5th  and/or  6th  order  moment  or  cumulant  ratios.  To 
aid  the  presentation,  the  /Bp  32  points  are  plotted  in  Fig.  2 and  also  some  other 
points  and  bounding  lines. 

Without  lengthy  calculation  which  it  is  hardly  worth  undertaking,  we  cannot 
be  sure,  as  n increases,  exactly  at  what  sample  sizes  the  points (/3j (b2) , B2(b2)) 
cross  over  from  one  "type  region"  to  another.  The  critical  boundaries  are: 

(i)  the  log-normal  line,  separating  Sg  from  S^; 

(ii)  the  Type  V line* (reciprocal  of  x2)  separating  Type  VI  from  Type  IV; 

(iii)  the  reciprocal  of  x line,  the  upper  boundary  of  the  non -central  t 
area. 


TABLE  5.  Data  regarding  the  distribution  of  b2  for  eight  values  of  the  sample 
size,  n. 


n 

*1 

B2 

y5/o5 

v°6 

/ 5 
<5/o 

/ 6 
k6/o 

Possible  approx- 
imating systems 

Source 
of  data 

25 

1.75 

8.90 

46.1 

309 

28.7 

175 

Type  IV,  Sg 

(a) 

40 

1.66 

8.78 

47.6 

352 

31.0 

223 

Type  VI,  Sy 

(a) 

50 

1.5821 

8.4164 

29.30 

(b) 

60 

1.51 

8.03 

42.0 

(a) 

75 

1.4099 

7.4933 

23.48 

Type  IV,  Sy, 

(b) 

100 

1.2772 

6.7740 

31.4 

18.66 

Non -central  t 

(b) 

150 

1.0917 

5.8258 

12.51 

(b) 

200 

0.9677 

5.2487 

18.7 

9.04 

(b) 

References:  (a)  Pearson  (1963)  pp.  105-8,  (b)  Pearson  (1965),  p.  284. 


It  has  been  established  by  computation  that"  the  beta-point 
across  this  boundary,  i.e.  in  the  non-central  t area. 


7ft  0 9 05  ll3  3 

that  the  beta-point  for  n = 50  falls  just 
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On  the  basis  of  this  diverse  information  and  assisted  by  extensive  simu- 
lation sampling, Pearson  and  D'Agostino  (1973)  proceeded  as  described  on  pp.  614- 
18  of  their  paper  and  produced  the  contour  charts  of  probability  levels  for  b2 
displayed  on  pp.  615,  616. 

We  have  spent  so  much  time  on  this  illustration  partly  to  warn  the  statistician 
that  in  spite  of  the  rather  elegant  results  displayed  in  Table  1,  he  must  not  hope 
too  much  from  the  4-moment  method  of  attack.  The  beta-point  approach,  with  a 
study  of  our  Tables  A and  B will  undoubtedly  often  provide  a method  of  entry  to 
the  process  of  finding  an  approximation  to  a mathematically  unknown  distribution. 
But  the  further  the  beta -point  is  from  that  of  a Normal  curve,  the  more  difficult 
it  will  be  to  find  a solution  in  which  we  can  have  confidence,  particularly  in 
the  tails.  The  moment  results  need  to  be  backed  by  an  extensive  simulation 
programme,  the  extent  of  which  unfortunately  may  be  found  to  be  prohibitive. 
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APPENDIX 

Notes  regarding  the  families  of  frequency  curves  compared  in  this 
paper  and  tables  which  have  been  useful  to  us  in  determining 
standardized  percentage  points.  (N.B.  B.T.S.  2 stands  for  Pearson 
and  Hartley's  Biometrika  Tables  for  Statisticians , Vol.  2 (1972). 

Pearson  curves. 

The  equations  of  the  main  curves  are  listed  on  p.  77  of  the  Introduction 
to  B.T.S.  2,  and  the  standardized  percentage  points  are  given  in  Table  32  of  that 
volume  to  arguments  /Bp  B2-  A rather  fuller  table  of  these  points,  used  in  check- 
ing and  expanding  this  Table  32,  was  computed  by  Amos  and  Daniel,  Sandia  Labor- 
atories Report  (1971),  No.  SC-RR-71-0348. 

Johnson  Sy. 

The  distribution  of  X when 

Z = y + 6 sinh_1{(X-£)/A}  (6, A > 0) 

is  a unit  normal  variable. 

B.T.S.  2 gives  values  of  -y  (Table  34)  and  6 (Table  35)  to  arguments  /BpBp 
The  second  impression  (1976)  contains  a corrected  Table  34.  A table  of  standardized 
percentage  points  of  Sp  to  arguments  /BpB2  (corresponding  to  B.T.S.  2 Table  32 
for  Pearson  curves)  was  computed  by  N.L.  Johnson  and  issued  as  No.  408  (1964)  of 

the  Department  of  Statistics  (UNC  Chapel  Hill)  Mimeo  Series. 

Johnson  Sg. 

The  distribution  of  X when 

Z = y + <5  log{(X-£)/(£+A-X)}  = y + 6 log(y/(l-y)}  (6,A>0;  £<X<£+A) 
is  a unit  normal  variable. 

B.T.S.  2,  Table  36  gives  values  of  y.  <$;  yj(y)  and  a(y)  to  arguments  /Bp8r 
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Burr  curves. 

The  cumulative  distribution  function  of  X is 

F(x)  = Pr{ (X<x) } = 1 - (l+xc)"k  , c,k,x  > 0 . 

For  given  c and  k,  one  can  find  the  moments  u,  o,  /Bp  Bp  "Then  for  any  given 
value  of  x,  the  corresponding  standardized  variable  is  y = (x-p)/a.  /Bj  and  B2 
can  be  found  in  terns  of  c and  k,  but  c and  k cannot  be  found  explicitly  in  terms 
of  desired  /B^  and  Bp  Thus  successive  approximation  is  needed.  Burr  (1973)  pro- 
vides a wide  coverage  of  c,k  for  given  /Bp  Bp  Moreover  further  coverage  into 
relatively  low  B2’s  can  be  made  through  letting  c be  negative.  Thus 

G(x)  = Pr((X<x)}=  (l+x"c)’k  c,k,x  > 0 . 

These  two  families  of  distribution  functions  together  cover  an  extremely  wide  area 
of  /Bp  B2  combinations. 


Certain  percentage  points  of  x'  are  given  for  arguments  v and  /A  in  B.T.S.  2, 
Tables  24  and  29.  This  table  was  derived  from  a table  of  percentage  points  of 
non-central  x2  distributions,  computed  by  N.L.  Johnson  and  issued  as  No.  568  (1968) 

of  the  Department  of  Statistics  (UNC  Chapel  Hill)  Mimeo  Series. 

Note  that 

Mean(x')2  = v + X,  Var(x')2  = 2(v  + 2X)  . 


Weibull  curves. 

The  cunulative  distribution  function  of  x is 

F(x|m)  = Pr{X>x|m}  = 1 - exp[-xm]  , (m  > 0,  x > 0)  . 

For  given  values  of  m and  F(x|m)  the  percentage  points  of  x can  be  found  by  inversion 
of  this  equation.  Harter  and  Dubey  (see  main  list  of  References  for  fuller 

details  of  their  Report ) have  given  in  an  appendix  the  mean,  variance,  82 

and  the  5th  14)  to  the  8th  standardised  cumulants  of  x for  m = 1.1(0.1)10.0. 
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Standardized  % points  of  P.C.  with  82, /81  specified,  also  amounts  to  add  to  obtain  Johnson  and  Burr  values 


TABLE  A. 9.  Standardized  % points  of  P.C.  with  62*  ^81  specified;  also  amounts  to  add  to  obtain  Johnson  and  Burr  values 
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Comparison  of  I points  of  non-central  t (t')  distributions  with  those  of  Pearson  curves  (Type  IV) 
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